

# Lecture 30

#### **Code Generation**

Awanish Pandey

Department of Computer Science and Engineering Indian Institute of Technology Roorkee



• a+e\*(c+d)



- a+e\*(c+d)
- Minimum registers required for correct execution?



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$
- **6**  $R_2 = e$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$
- $R_1 = R_1 + R_2$
- **6**  $R_2 = e$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- 2  $R_1 = c$
- **3**  $R_2 = d$
- $R_1 = R_1 + R_2$
- **6**  $R_2 = e$



- a+e\*(c+d)
- Minimum registers required for correct execution?
- **1**  $R_0 = a$

**1**  $R_0 = c$ 

- ②  $R_1 = c$
- **3**  $R_2 = d$
- $Q R_1 = R_1 + R_2$
- **6**  $R_2 = e$
- $R_2 = R_2 * R_1$



• Minimum registers required for correct execution?

**1** 
$$R_0 = a$$

**1** 
$$R_0 = c$$

② 
$$R_1 = c$$

② 
$$R_1 = d$$

$$R_2 = d$$

$$R_1 = R_1 + R_2$$

**6** 
$$R_2 = e$$

$$R_2 = R_2 * R_1$$



• Minimum registers required for correct execution?

**1** 
$$R_0 = a$$

② 
$$R_1 = c$$

**3** 
$$R_2 = d$$

$$R_1 = R_1 + R_2$$

**6** 
$$R_2 = e$$

**1** 
$$R_0 = c$$

② 
$$R_1 = d$$



- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$
- $R_1 = R_1 + R_2$
- **6**  $R_2 = e$

**1** 
$$R_0 = c$$

② 
$$R_1 = d$$

**4** 
$$R_1 = e$$



- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$
- Q  $R_1 = R_1 + R_2$
- **6**  $R_2 = e$

**1** 
$$R_0 = c$$

② 
$$R_1 = d$$

**4** 
$$R_1 = e$$



- Minimum registers required for correct execution?
- **1**  $R_0 = a$
- ②  $R_1 = c$
- **3**  $R_2 = d$
- $R_1 = R_1 + R_2$
- **6**  $R_2 = e$

**1** 
$$R_0 = c$$

② 
$$R_1 = d$$

**4** 
$$R_1 = e$$

$$R_1 = R_1 * R_0$$

**1** 
$$R_0 = a$$



• Minimum registers required for correct execution?

**1** 
$$R_0 = a$$

② 
$$R_1 = c$$

**3** 
$$R_2 = d$$

$$Q R_1 = R_1 + R_2$$

**6** 
$$R_2 = e$$

$$Q R_0 = R_0 + R_2$$

**1** 
$$R_0 = c$$

② 
$$R_1 = d$$

$$Q R_1 = e$$

$$R_1 = R_1 * R_0$$

**1** 
$$R_0 = a$$



• When should we do?



- When should we do?
  - ► Instruction Selection



- When should we do?
  - ► Instruction Selection
  - ► Instruction Reordering



- When should we do?
  - ► Instruction Selection
  - ► Instruction Reordering
- Type of allocator



- When should we do?
  - ► Instruction Selection
  - ► Instruction Reordering
- Type of allocator
  - Using usages count



- When should we do?
  - ► Instruction Selection
  - Instruction Reordering
- Type of allocator
  - Using usages count
  - ► Graph coloring based (Near optimal/ Build interferance graph)



- When should we do?
  - ► Instruction Selection
  - Instruction Reordering
- Type of allocator
  - Using usages count
  - ► Graph coloring based (Near optimal/ Build interferance graph)
  - ► linear scan (Fast)



- When should we do?
  - ► Instruction Selection
  - Instruction Reordering
- Type of allocator
  - Using usages count
  - Graph coloring based (Near optimal/ Build interferance graph)
  - ▶ linear scan (Fast)
  - ► Local register allocation







• Visibility of only current instruction.



- Visibility of only current instruction.
- DAG of the basic block will give more visibility



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - ▶ It stores data dependencies



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - ▶ It stores data dependencies
  - ▶ Helps in common sub-expression elimination



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - It stores data dependencies
  - ▶ Helps in common sub-expression elimination
  - Useful in identifying code generation order



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - It stores data dependencies
  - ▶ Helps in common sub-expression elimination
  - Useful in identifying code generation order
  - Useful in instruction reordering



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - ► It stores data dependencies
  - ▶ Helps in common sub-expression elimination
  - Useful in identifying code generation order
  - Useful in instruction reordering
- Code generation based on DAG



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - ► It stores data dependencies
  - ► Helps in common sub-expression elimination
  - Useful in identifying code generation order
  - Useful in instruction reordering
- Code generation based on DAG
  - Perform topological numbering for the DAG nodes



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - It stores data dependencies
  - ▶ Helps in common sub-expression elimination
  - Useful in identifying code generation order
  - Useful in instruction reordering
- Code generation based on DAG
  - Perform topological numbering for the DAG nodes
  - Generate code for nodes in reverse topological ordering



- Visibility of only current instruction.
- DAG of the basic block will give more visibility
  - ► It stores data dependencies
  - ▶ Helps in common sub-expression elimination
  - Useful in identifying code generation order
  - Useful in instruction reordering
- Code generation based on DAG
  - Perform topological numbering for the DAG nodes
  - Generate code for nodes in reverse topological ordering
  - Optimal code generation is NP-hard





• Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ► Label all leaves as 1.



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ► Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ▶ Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ▶ Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children
    - ★ Larger of the label of its children if those labels are different.



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ► Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children
    - ★ Larger of the label of its children if those labels are different.
    - ★ One plus the label of its children if both the labels are same.



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ► Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children
    - ★ Larger of the label of its children if those labels are different.
    - ★ One plus the label of its children if both the labels are same.
- Recursive algo, start from root node. If a node is labelled as k then only k registers will be used



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ▶ Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children
    - ★ Larger of the label of its children if those labels are different.
    - ★ One plus the label of its children if both the labels are same.
- Recursive algo, start from root node. If a node is labelled as k then only k registers will be used
- There is a "base" b  $\geq 1$  for registers use. so that actual registers used are  $R_b, R_{b+1}, \cdots, R_{b+k-1}$  and the result always appears in  $R_{b+k-1}$



- Ershov Numbers: Numbers of registers needed to evaluate a node without storing any temporaries.
  - ► Label all leaves as 1.
  - ▶ The label of an interior node with one child is the label of its child
  - ▶ The label of an interior node with two children
    - ★ Larger of the label of its children if those labels are different.
    - ★ One plus the label of its children if both the labels are same.
- Recursive algo, start from root node. If a node is labelled as k then only k registers will be used
- There is a "base" b  $\geq 1$  for registers use. so that actual registers used are  $R_b, R_{b+1}, \cdots, R_{b+k-1}$  and the result always appears in  $R_{b+k-1}$
- For a leaf operand x, if base is b generate the instruction LD  $R_h$ , x

